Spherical perspective on learning with normalization layers

نویسندگان

چکیده

Normalization Layers (NLs) are widely used in modern deep-learning architectures. Despite their apparent simplicity, effect on optimization is not yet fully understood. This paper introduces a spherical framework to study the of neural networks with NLs from geometric perspective. Concretely, radial invariance groups parameters, such as filters for convolutional networks, allows translate steps L2 unit hypersphere. formulation and associated interpretation shed new light training dynamics. Firstly, first effective learning rate expression Adam derived. Then demonstration that, presence NLs, performing Stochastic Gradient Descent (SGD) alone actually equivalent variant constrained hypersphere, stems framework. Finally, this analysis outlines phenomena that previous variants act importance process experimentally validated.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a head parameter survey on mazandarani dialect and its effect(s) on learning english from ca perspective (on the basis of x-bar syntax)1

there has been a gradual shift of focus from the study of rule systems, which have increasingly been regarded as impoverished, … to the study of systems of principles, which appear to occupy a much more central position in determining the character and variety of possible human languages. there is a set of absolute universals, notions and principles existing in ug which do not vary from one ...

15 صفحه اول

Electrophoretic Motion of Two Spherical Particles with Thick Double Layers

The electrophoretic mobilities of two interacting spheres are calculated numerically for arbitrary values of the double-layer thickness. A general formula for the electrophoretic translational and angular velocities of N interacting particles is derived for low-zeta-potential conditions. The present calculation complements the well-studied case of thin double layers. The results are compared wi...

متن کامل

Spherical Orbits’ Closures in Simple Projective Spaces and Their Normalization

Let G be a simply connected semisimple algebraic group over an algebraically closed field k of characteristic 0 and let V be a rational simple G-module. If G/H ⊂ P(V ) is a spherical orbit, set X = G/H ⊂ P(V ) its closure. Then we describe the orbits of X and of its normalization e X in terms of spherical systems and we give necessary and sufficient conditions so that the normalization e X → X ...

متن کامل

Spherical Orbit Closures in Simple Projective Spaces and Their Normalization

Let G be a simply connected semisimple algebraic group over an algebraically closed field k of characteristic 0 and let V be a rational simple G-module. If G/H ⊂ P(V ) is a spherical orbit, set X = G/H ⊂ P(V ) its closure. Then we describe the orbits of X and those of its normalization e X in terms of spherical systems and we give necessary and sufficient conditions so that the normalization e ...

متن کامل

Spherical indentation testing of poroelastic relaxations in thin hydrogel layers

In this work, we present the Poroelastic Relaxation Indentation (PRI) testing approach for quantifying the mechanical and transport properties of thin layers of poly(ethylene glycol) hydrogels with thicknesses on the order of 200 mm. Specifically, PRI characterizes poroelastic relaxation in hydrogels by indenting the material at fixed depth and measuring the contact area-dependent load relaxati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2022

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2022.02.021